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GROWTH DIFFERENTIATION FACTOR PROMOTER 

AND USES THEREFOR 

5 
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Field of the Invention 

The invention relates to GDF promoters, such as GDF-8 promoters, as well as 
1 5 methods of using them, e.g., methods for screening for regulatory compounds of GDF-8 
expression. 

Background of the Invention 

GDF-8 is a member of the TGF-p superfamily, which encompasses a large group 
20 of growth and differentiation factors that play important roles in regulating embryonic 
development and in maintaining tissue homeostasis in adult mammals. GDF-8 appears 
to function specifically as a negative regulator of skeletal muscle growth and, therefore, 
has potential applications in producing livestock and game animals, such as cows, sheep, 
pigs, chicken, turkey, and fish which are relatively high in musculature and protein, and 
25 low in fat content. In addition, GDF-8 has potential applications in various cell 

proliferative and differentiation disorders, especially those involving muscle, nerve and 
adipose tissues in both human and animals. GDF-8 also appears to be involved in 
glucose transport and, therefore, has potential applications in the treatment or diagnosis 
of glucose transport associated disorders such as diabetes. 

30 

Many drug and diet regimens exist which may help increase muscle and protein 
content and lower undesirably high fat and/or cholesterol levels, but such treatment is 
generally administered after the fact, and is begun only after significant damage has 
occurred to the vasculature. Accordingly, it would be desirable to produce animals 
35 which are genetically predisposed to having higher muscle content, without any 

ancillary increase in fat levels. U.S. Patent Application number 09/019,070, inventors 
Se-Jin Lee and Alexandra C. McPherron, filed February 5, 1998 and entitled Growth 
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Differentiation Factor-8 also describes the production of GDF-8, as well as potential 
uses. This application is also hereby incorporated by reference. 

Control of GDF-8 gene expression is highly desirable. The availability of 
5 discreet DNA segments which are capable of conferring either a negative or positive 
control capability to known genes in eukaryotic systems is generally lacking in the art. 
Isolation of regulatory genetic sequences for GDF-8 is disclosed in the present 
invention. 

1 0 Summary of the Invention 

The present invention relates to the molecular regulation of GDF-8 expression 
and, in particular, to the isolation and identification of regulatory sequences of GDF 
gene promoters, such as the GDF-8 gene promoter. 

15 In one embodiment, the present invention provides the complete nucleotide 

sequence and identification of genetic regulatory elements which promote expression of 
GDF-8. 

In another embodiment, the present invention provides a method of screening for 
20 compounds which regulate GDF-8 expression, for example, by inhibiting or by 
stimulating GDF-8 expression. 

In yet another embodiment, the present invention provides a DNA expression 
construct containing the GDF-8 promoter operatively linked to a gene of interest (GOI), 
25 and a method of expressing a GOI in muscle and other tissues (e.g., tissues in which 
GDF-8 is naturally expressed). 

$ 

Brief Description of the Drawings 

Figures J A and IB show the results of a human GDF-8 promoter-luciferase 
30 reporter construct transfection assay in 6 different cell lines. 

Figure 2 shows the nucleotide sequence corresponding to the human GDF-8 
promoter element (SEQ ID NO:l). The initiation codon, ATG, is underlined in bold 
letters. The boxed sequences are the three mutated regions. The numbers represent the 
35 nucleic acid positions upstream of the ATG. 
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Figure 3A is a schematic representation of various human GDF-8 promoter 
reporter constructs. Figure 3B is a graph showing the Luciferase activity of these 
constructs in RD (human embryonal rhabdomyosarcoma) cells. Luciferase activities are 
expressed as percent of control (pGL3-0.65, luciferase reporter plasmid containing 0.65 
5 Kb of sequence upstream of the ATG site). The relative luciferase activities of the 
reporter plasmids are normalized to the p-galactosidase activity. 

Figure 4A is a schematic representation of various mutated human GDF-8 
promoter-luciferase reporter constructs. Figure 4B is a graph showing the luciferase 
10 activity of these constructs in RD cells. Luciferase activities are expressed as percent of 
control (pGL3-0.65, luciferase reporter plasmid containing 0.65 Kb of sequence 
upstream of the ATG site). The relative luciferase activities of the reporter plasmids are 
normalized to the P-galactosidase activity. 

1 5 Figure 5 A shows the double stranded DNA oligonucleotide sequences used in 

constructing human GDF-8 promoter-luciferase reporter constructs containing 
concatemers of CAAATG and GACAGC sequences. Box 1 sequence corresponds to 
SEQ ID NO: 2. Box 3 sequence corresponds to SEQ ID NO: 3. Figure 5B is a graph 
showing the luciferase activity of these constructs in RD cells. Luciferase activities are 

20 expressed as percent of control (pGL3-0.65, luciferase reporter plasmid containing 0.65 
Kb of sequence upstream of the ATG site). The reporter construct pGL-0.21 is the 
parental plasmid of the concatemer reporter constructs. The relative luciferase activities 
of the reporter plasmids were normalized to the P-galactosidase activity. 

25 Figure 6 is a graph showing the luciferase activities of the various stable RD 

clones containing the pGL3-0.65 luciferase reporter sequence referred to in the 
description of Figure 5B, and the effects of TGF-P and TNFcc on luciferase reporter 
expression. 

30 Figure 7 A shows the nucleotide sequence for the mouse GDF-8 promoter region 

(SEQ ID NO: 4). Figure 7B shows the nucleotide sequence for the chicken GDF-8 
promoter region (SEQ ID NO: 5). 

Figure 8 shows an alignment of the nucleotide sequences for human, mouse, pig 
35 and chicken GDF-8 promoter elements (SEQ ID NOS: 6, 7, 8 and 9, respectively) 



.1 
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upstream of the TATAA box (underlined). The CAAATG, CAGACA and GACAGC 
sequences are boxed. The shaded areas represent regions of sequence homology. 

Detailed Description of the Invention 
5 The term a "GDF promoter," as used herein, refers to nucleotide sequence 

elements located upstream of the 5' end of a GDF gene (e.g., a gene encoding GDF-8 or 
another related or another homologous growth factor) which regulate (e.g., initiate, 
upregulate or downregulate) transcription and/or expression of the gene. For example, a 
GDF promoter can include sequence elements necessary to initiate gene transcription, 
1 0 enhancer elements, repressor elements and other cis-acting control elements which 
modulate gene expression. Accordigly, the term "GDF promoter" is used herein 
interchangeably with the term "GDF regulatory region" or "GDF control region." 

An "isolated GDF promoter" refers to a GDF promoter which is removed from 
1 5 its natural sequence context. For example, an isolated GDF promoter can be a GDF 
promoter cloned or otherwise removed from its natural source, and inserted upstream 
from the 5' end of a heterologous structural gene, e.g., within an expression vector. The 
term "GDF promoter" also refers to nucleotide sequences having sufficient homology to 
a GDF promoter that it exhibits one or more functions of the GDF promoter (e.g., drives 
20 transcription of a gene operably linked to the promoter). Generally, GDF promoters are 
derived from the 5' flanking region of a GDF gene. 

GDF promoters of the invention include those derived from any GDF gene, such 
as the GDF-8 gene or a homologous gene (e.g., GDF-1 1). The term "derived from", as 

25 it is used herein, refers to a source or origin for an isolated GDF promoter of the 

invention. For example, a GDF promoter that is "derived from" a particular GDF gene 
(e.g., a GDF-8 gene) will be identical or highly homologous in nucleotide sequence to 
the GDF promoter of a naturally occurring GDF gene (e.g., a GDF-8 gene). Isolated 
GDF promoters of the invention which are "derived from" GDF genes, such as GDF-8 

30 genes, also include those which have been modified by insertion, deletion or substitution 
of one or more nucleotides but which retain substantially the same activity or function. 

The term "a GDF gene" refers to a GDF gene from any naturally possessing the 
GDF gene, including, but not limited to human, chicken, cow, sheep, fish, pig and 
35 mouse. A "GDF gene" also refers to a GDF gene from any piscine, crustacean or 

mollusk naturally possessing the GDF gene. In a specific embodiment, the invention 



MTN-027DV 1 CN 



-5- 

provides a human GDF-8 promoter comprising all or a portion (or portions) of the 
nucleotide sequence shown in Figure 2 (SEQ ID NO:l), as well as GDF-8 promoters 
from other mammals having regions of homology to the human promoter sequence, 
particularly in regions required for activity of the promoter sequence. Generally, this 
5 range of homology is about 60% to 90% or higher. For example, homologous regions 
within GDF-8 promoters from human (SEQ ID NO:6), mouse (SEQ ID NO:7), pig 
(SEQ ID NO:8) and chicken (SEQ ID NO:9) GDF-8 genes are shown in Figure 8. 

Accordingly, GDF promoters of the invention include promoters having a 

10 nucleotide sequence at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or more 
identical to the nucleotide sequence set forth in SEQ ID Nos: 1, 4, 5, 6, 7, 8 and 9, and 
which modulate expression of a gene operably linked to the promoter. To determine the 
percent identity of two nucleic acid sequences, the sequences are aligned for optimal 
comparison purposes (e.g., gaps can be introduced in one or both of a first and a second 

1 5 nucleic acid sequence for optimal alignment and non-homologous sequences can be 
disregarded for comparison purposes). In a preferred embodiment, the length of a 
reference sequence aligned for comparison purposes is at least 30%, preferably at least 
40%, more preferably at least 50%, even more preferably at least 60%, and even more 
preferably at least 70%, 80%, or 90% of the length of the reference sequence. The 

20 nucleotides at corresponding nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at that position (as used herein 
nucleic acid "identity" is equivalent to nucleic acid "homology"). The percent identity 
between the two sequences is a function of the number of identical positions shared by 

25 the sequences, taking into account the number of gaps, and the length of each gap, 
which need to be introduced for optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity between two 
sequences can be accomplished using a mathematical algorithm. In a preferred 
30 embodiment, the percent identity between two nucleotide sequences is determined using 
the GAP program in the GCG software package (available at http://www.gcg.com), 
using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length 

■ 

weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two 
nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller 
35 (CABIOS, 4:11-17 (1989)) which has been incorporated into the ALIGN program 
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(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a 
gap penalty of 4. 

Accordingly, GDF promoters of the invention can be identified by comparing the 
5 regions 5' of the transcription initiation site of GDF genes to the specific GDF promoter 
sequences provided herein (e.g., SEQ ID NO: 1 corresponding to the human GDF-8 
promoter) and looking for regions of homology, corresponding to the active promoter 
sequences. The specific GDF-8 promoter sequences provided by the present invention 
also can be used to screen for homologous sequences from other species using standard 
10 DNA hybridization protocols (e.g., under conditions of high stringency). 

GDF promoters of the invention contain DNA sequence elements which ensure 
proper binding and activation of RNA polymerase, influence where transcription will 
start, and affect the level of transcription. In addition, specific regulatory sequences that 
1 5 are functional in the regulation (induction and repression) of gene expression responsive 
to stimuli or specific chemical species also may be included within the promoter 
sequence. 

A DNA "coding sequence", "coding region", or a "sequence encoding" a 
20 particular protein is a DNA sequence which is transcribed and translated into a 

polypeptide in vitro or in vivo when placed under the control of appropriate regulatory 
elements. The boundaries of the coding sequence are determined by a start codon at the 
5'-terminus and a translation stop codon at the 3'-terminus. A coding sequence can 
include, but is not limited to, cDNA from eukaryotic mRNA, genomic DNA sequences 
25 from eukaryotic (e.g., mammalian, animal, avian etc.) sources, and even synthetic DNA 
sequences. A transcription termination sequence will usually be located 3' to the coding 
sequence. 

The term "reporter gene", as used herein, refers to a gene encoding a protein 
30 which is readily quantifiable or observable. Because gene regulation usually occurs at 
the level of transcription, transcriptional regulation and promoter activity are often 
assayed by quantitation of gene products. For example, promoter regulation and activity 
has often been quantitatively studied by the fusion of the easily assayable E. coli lacZ 
gene to heterologous promoters (Casadaban and Cohen (1980) J. Mol. Biol. 138:179- 
35 207). The structural gene for chloramphenicol acetyl transferase (CAT), green 
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fluorescence protein (GFP), and luciferase are other genes commonly used to detect 
activity of a promoter or other regulatory sequence. 

The term "tissue-specific expression", as it is used herein, refers to a limited or 
5 characteristic pattern of gene expression among cell types. In other words, expression of 
a gene is observed in certain tissues of an organism but not in other tissues. For 
example, "muscle-specific" expression of a gene denotes that that gene is expressed in 
the muscle and perhaps limited other tissues, but is not expressed in all tissues (e.g., 
globally). 

10 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached 
segment. An "expression vector" means any DNA vector (e.g., a plasmid vector) 
containing the necessary genetic elements for expression of a desired gene, including a 

15 promoter region of the present invention. These elements are "operably linked" to the 
gene, meaning that they are located at a position within the vector which enables them to 
have a functional effect on transcription of the gene. The regulatory elements need not , 
be contiguous with the coding sequence, so long as they function to direct the expression 
thereof. Thus, for example, intervening untranslated yet transcribed sequences can be 

20 present between a promoter and the coding sequence and the promoter can still be 
considered "operably linked" or "in operable linkage to" the coding sequence. 

A cell has been "transformed" by exogenous DNA (e.g., a transgene) when such 
exogenous DNA has been introduced inside the cell membrane. Exogenous DNA may 
25 or may not be integrated into the chromosomal DNA comprising the genome of the cell. 
With respect to eukaryotic cells, though, a stably transformed cell is one in which the 
exogenous DNA has become integrated into the chromosome such that it is inherited by 
daughter cells though chromosome replication. 

30 A "host cell" is a cell that has been transformed, or is capable of transformation, 

by an exogenous nucleic acid molecule. 

A "transgene" refers to a nucleic acid which is introduced into a cell. Typically, 
the transgene is integrated into the genome of the cell following introduction. The 
35 transgene can encode a protein which is not expressed in the cell or which is expressed 
in the cell at low levels or in defective form. 
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A "transgenic animal" is an animal carrying in its cells at least one transgene. 
For example, the transgenic animal can contain in its cells a transgene corresponding to 
a gene of another species which has been introduced into the germline of the animal, 
5 such that the introduced gene is present in all somatic and germline cells. 

GDF promoters of the invention can vary in size. Generally, the promoter 
sequence spans or is located within approximately 500 bases to 3000 bases of sequence 
in the 5' direction (or upstream) to the site of transcription initiation. However, the 

10 promoter can include sequences out to approximately 4000 bp or further 5' to the site of 
transcription initiation. When employed in the context of a heterologous structural gene, 
the optimal location of the GDF promoter with respect to the transcription initiation site 
can vary. Generally, the same benefit will be obtained when the GDF promoter is 
located anywhere up to about 300 nucleotides or more upstream from the transcription 

15 initiation site. However, in a preferred embodiment, the GDF promoter is located within 
150 nucleotides of the transcription initiation site. 

The majority of promoters control initiation of transcription in one direction 
only. Therefore, in order to be under the control of a GDF promoter of the invention, a 
20 structural gene generally must be located downstream (in the 3' direction) of the GDF 
promoter and in the correct orientation with respect to the promoter. One or several 
genes may be under the control of a single GDF promoter or, conversely, one or more 
GDF promoters may control a single structural gene. 

25 In one embodiment, the GDF promoter regulates expression of a gene 

operatively linked to the promoter by changing the ability of RNA polymerase to bind to 
DNA sequences within the GDF promoter. For example, a regulatory protein can bind 
to a DNA sequence at or near the position of RNA polymerase binding to enhance or 
prevent transcription. Alternatively, a regulatory protein (e.g., an inducer or repressor 

30 molecule) can directly or indirectly interact with RNA polymerase itself to change its 
specificity for recognition and binding to a DNA sequence of the GDF promoter. In 
either case, specific sequence(s) within the GDF promoter are involved in the 
mechanism of regulation. 

35 The term "recombinant DNA molecule" is used herein to distinguish DNA 

molecules in which heterologous DNA sequences have been artificially cleaved from 
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their natural source or ligated together by the techniques of genetic engineering, for 
example, by in vitro use of restriction enzymes or ligation using DNA ligase. 

GDF promoters of the invention can be identified and cloned from their natural 
5 sources (e.g., the genome of a human, mouse, chicken, pig, cow, fish or sheep). The 
process of cloning a DNA fragment involves excision and isolation of the DNA 
fragment from its natural source, insertion of the DNA fragment into a recombinant 
vector and incorporation of the vector into a microorganism or cell where the vector and 
inserted DNA fragment are replicated during proliferation of the microorganism or cell. 

1 0 The term "cloned DNA fragment" or "cloned DNA molecule" refers to a DNA fragment 
or molecule produced by the process of cloning, as well as copies (or clones) of the 
DNA fragment or molecule replicated therefrom. Standard techniques for cloning, DNA 
isolation, DNA amplification and purification, enzymatic reactions (e.g., involving DNA 
ligase, polymerase, or restriction endonucleases) and various separation techniques, 

1 5 which known and commonly employed by those skilled in the art, can be used in the 
present invention. A number of these standard techniques are described in: Maniatis et 
al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY; 
Wu (ed.)(1979) Meth. Enzymol 68; Wu et al. (Eds.) (1983) Meth. Enzymol. 100 & 101; 
Grossman and Moldave (eds.) (1980) Meth. Enzymol. 65; Miller (ed) (1972) Exp. Mol. 

20 Genetics, Cold Spring Harbor, NY; Old and Primrose (1981) Principles of Gene 
Manipulation, Univ. of Cal. Press, Berkely; Schlief and Wensink (1982) Practical 
Methods in Molecular Biology; Glover (ed) 1985( DNA Cloning, Vols. I and II, IRL 
Press, Oxford, UK; Sellow and Hollaender (1979) Genetic Engineering: Principles and 
Methods, Vols I, Plenum Press, NY; which are incorporated by reference in their 

25 entirety herein. Abbreviations, where employed, are those deemed standard in the field 
and commonly used in professional journals such as those cited herein. 

Expression of a gene requires both transcription of DNA into mRNA and the 
subsequent translation of the mRNA into a protein. Because gene regulation usually 

30 occurs at the level of transcription, transcriptional regulation and activity of GDF 

promoters of the present invention can be assayed by quantitation of gene products. For 
example, promoter regulation and activity can be quantitatively studied by the fusion of 
the easily assayable E. coli lacZ gene sequence to a heterologous promoter (Casadaban 
and Cohen (1980) J. Mol. Biol. 138:179-207). Alternatively, the genes coding for 

35 chloramphenicol acetyl transferase (CAT), green fluorescence protein (GFP) and 

luciferase can be used to detect activity of a promoter. Such genes are termed "reporter" 
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genes which, when combined with a given promoter (usually a heterologous promoter), 
provide a ready assay for promoter activity. 

GDF promoters and GDF promoter elements of the invention (i.e., selected 
5 sequences within GDF-8 promoters involved in their regulatory function) may be 

employed in the form of single or multiple units, in numerous various combinations and 
organizations, in forward or reverse orientations, and the like. In the context of multiple 
unit embodiments and/or in embodiments which incorporate both positive and negative 
control elements, there is no requirement that such units be arranged in an adjacent 

10 head-to-head or head-to-tail construction since the improved regulation capability of 
such multiple units is conferred virtually independent of the location of such multiple 
sequences with respect to each other. Moreover, there is not requirement that each unit 
comprise the same positive or negative element. Such sequences can be located 
upstream of and sufficiently proximal to a transcription initiation site, in the intron or 

1 5 downstream of the gene of interest, to confer a desired regulatory effect. In addition, 

GDF promoter and GDF promoter elements of the invention can be used in numerous a 
various combinations with promoters and regulatory elements of other genes to achieve 
the desired enhancement or repression of the expression of any gene of interest. 

20 Accordingly, in one embodiment of the invention, the GDF promoter is used to 

regulate transcription of a heterologous structural gene by simply obtaining the 
structural gene and inserting one or more copies of the GDF promoter upstream of the 
gene's transcription initiation site. Additionally, as is known in the art, it is generally 
desirable to include TATA-box sequences upstream of and proximal to the transcription 

25 initiation site of the heterologous structural gene. Such sequences may be synthesized 
and inserted in the same manner as the novel control sequences of the invention. 
Alternatively, the TATA sequences naturally associated with the heterologous structural 
gene can be employed. Generally, the TATA sequences are located between about 20 
and 30 nucleotides upstream of transcription initiation. 

30 

Numerous methods are known in the art for precisely inserting selected 
nucleotide sequences, at selected points, within larger sequences. In one method, the 
desired control sequences, or combinations of sequences, are synthesized and restriction 
site linker fragments added to the control sequence termini. This allows for ready 
35 insertion of the control sequences into compatible restriction sites within upstream 

regions. Alternatively, synthesized control sequences can be ligated directly to selected 
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regions. In addition, site specific mutagenesis can be employed to fashion restriction 
sites into which control sequences may be inserted, in the case where no convenient 
restriction sizes are found at a desired insertion site. 

5 GDF promoters of the present invention can be beneficially employed in the 

context of any heterologous gene, with or without additional homologous or 
heterologous control or promotion sequences. In a particular embodiment, the present 
invention provides the GDF-8 gene promoter and optionally other regulatory sequences 
which function in the induction of GDF-8 expression in response to factors which are 
10 known to induce GDF-8 expression. 

Any suitable reporter gene can be used to measure the activity of a GDF 
promoter element of the invention. For example, in the examples described below, a 
GDF-8 promoter-luciferase reporter construct was used. The GDF-8 promoter- 

15 luciferase reporter construct was active in two cell lines (Figures 1A-1B), RD (human 
embryonal rhabdomyosarcoma) and A673 (human rhabdomyosarcoma). Maximum 
activity was observed with the 0.65 Kb 5' flanking fragment (5-8% of the SV40 
luciferase reporter plasmid, pGL3 -control). No significant luciferase activities were 
detected in the HepG2 (human liver hepatoblastoma), HT29 (human colon 

20 adenocarcinoma), PA-1 (human ovarian teratorcarcinoma), and MCP-7 (human breast 
adenocarcinoma). The results of the transfection assays suggest that 1) a repressor 
element(s) exists between 8.5 Kb and 0.65 Kb upstream of the human GDF-8 gene, 2) 
the 0.65 Kb fragment contains the minimal promoter, and 3) the minimal promoter is 
tissue specific and appears to be active in muscle cell lines only. 

25 

In yet another embodiment, the present invention provides a method of screening 
for a compound which binds to a GDF promoter (or a portion thereof), such as a GDF-8 
promoter), and modulates expression of a GDF gene (e.g., GDF-8) or a heterologous 
gene. As used herein, the term "modulate" includes both inhibition and stimulation of 
30 GDF expression. Moreover, the term "inhibition" is intended to include both complete 
and partial inhibition of GDF expression. In various embodiments, GDF expression is 
inhibited to a level at least 1.2-fold, 1.5-fold, 1.8-fold, 2-fold, 2.5-fold, 3-fold, 4-fold or 
5 -fold lower than the wild type level of GDF-8 expression. In further embodiments, 
GDF expression is inhibited by at least 10%, 20%, 30%, 40 %, 50 %, 75 % or 100 %. 

35 
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To test compounds for their ability to modulate GDF transcription and/or 
expression, the compound can be tested for its ability to increase or stimulate the 
transcription and/or expression or to decrease or inhibit the transcription and/or 
expression of a reporter gene (e.g., the SV40 [3-galactosidase reporter gene) which is 
5 operatively linked to a GDF promoter of the invention, compared to that of a control 
reporter. Effects of the test compound are determined by changes in reporter gene 
activity. For example, a stable cell line containing a GDF-8 promoter operatively linked 
to a reporter gene, such as the luciferase gene, can be used in any well-known screening 
method known in the art for detecting expression (e.g., luciferase assays, CAT assays, or 
10 Green Fluorescent Protein (GFP) assays). However, the invention is not restricted to 
these suggested screening methods. 

Transcription factors that bind to GDF promoters of the invention also can be 

t 

characterized using gel mobility shift assays and these transcription factors can be 
1 5 cloned using these specific sequences as probes in screening expression libraries. 

Alternatively, second generation reporter constructs containing multiple copies of the 
following transcription factor binding sequences: CAAATG, C AG AC A or GACAGC 
and a minimal promoter (0.2 kb upstream of initiating ATG), or a combination of these 
sequences and a minimal promoter, can be used in high-throughput screening assays to 
20 identify inhibitors specific for GDF genes which operate by binding to GDF promoter 
sequences. After these transcription factors have been identified, they too, may be used 
as targets for identifying other inhibitors. For example, these transcription factors can 
be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. 
Patent No. 5,283,3 17; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol 
25 Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; and Brent WO94/10300), or as protein probes in the 
screening of expression libraries to identify other proteins, which bind to or interact with 
the transcription factors. Such transcription factor-binding proteins are also likely to act 
as modulators, e.g., inhibitors of GDF-8 expression. 

30 

In one embodiment, the invention provides a GDF-8 promoter, or a portion 
thereof, which can be operatively linked to a gene of interest (GOI) and expressed in a 
tissue-specific manner. GDF-8 promoter activity is specific to muscle tissue. Therefore, 
the GDF-8 promoter can be used to express any GOI for which expression is desired in 
35 muscle tissue. Examples of such genes include, but are not limited to, GDF-8 itself, 

dystrophin, growth factors, genes coding for tumor or pathogens antigens. Genes which 
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express proteins useful in vaccination are also encompassed, including viral, tumor, 
pathogenic, or bacterial antigens, specifically AIDS envelope protein gpl20. However, 
this is not intended to be a limiting list. Any gene which expresses a protein of interest 
may be employed in the methods of the invention. 

5 

Expressing a protein of interest specifically in muscle tissue is highly preferred 
in the area of gene therapy due to the amount of muscle mass in the body and the ease in 
which muscle can express foreign genes. Accordingly, the GDF-8 promoter of the 
invention can be used in gene therapy vectors to direct expression of a gene of interest in 

10 muscle tissue. Gene therapy vectors including the GDF-8 promoter of the invention can 
be delivered to a subject by, for example, intravenous injection, local administration (see 
U.S. Patent 5,328,470), stereotactic injection (see e.g., Chen et al. (1994) Proc. Natl. 
Acad Sci. USA 91:3054-3057), or direct intramuscular injection (as described in, for 
example, U.S. Patent Nos. 5,580,859 and 5,589,466). The gene therapy vectors 

15 containing the GDF-8 promoter of the invention can be used for the treatment of a 
muscle-associated disorder such as cancer, muscular dystrophy, spinal cord injury, 
traumatic injury, congestive obstructive pulmonary disease, AIDS or cachexia; or for the 
treatment of obesity and related disorders, e.g., diabetes; or disorders related to 
abnormal proliferation of adipocytes. 

20 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are incorporated herein by reference. 
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EXAMPLES 

Example 1: Screening of human genomic library 

5 

A Stratagene human genomic library (HT1080) was screened using a 745 bp 
EcoRI-Hind III human GDF-8 cDNA probe (described in, for example, U.S. patent 
application 08/525,596). Three genomic clones were isolated using methods described 
in Current Protocol in Molecular Biology, Eds. Ausubel et al., John Wiley & Sons, Inc., 

10 996). An 8.5 kilobase (Kb) fragment containing the 5' flanking region of human GDF-8 
was subcloned into the luciferase reporter construct pGL-3 -basic (Promega). This 8.5 
kilobase fragment was truncated by a restriction digestion or by PCR to generate six 
additional clones: 5.3 Kb, 3.6 Kb, 0.65 Kb, 0.3 Kb, 0.2 Kb, and 0.1 Kb. Approximately 
1 Kb upstream, of the initiating ATG has been sequenced. The approximate 

1 5 transcription start site, based on the positions of the CC AAT and TATA A boxes is about 
100 bases upstream of the ATG. Therefore, the luciferase clone containing 0.1 Kb 
contains only the 5 ' untranslated region. 

20 Example 2: Transfection Assays 

Transient transfection assays were performed using FuGENE (Boehringer 
Mannheim) according to the manufacturer's protocol. Six different human cell lines, 
RD, A673, HepG2, HT29, PA-1, and MCF-7 (ATCC, Rockville, Maryland) were used 

25 in the transfection assays. On the day before transfection, 1.5 x 10 5 cells in 2 ml of 
DMEM with 10% FBS were seeded in 35 mm tissue culture dishes. The cells were 
incubated overnight until the cells were 50-70% confluent. For each transfection, 1.0 jag 
of luciferase reporter plasmid, 0.1 jag of (3-galactosidase reporter plasmid, pSV-fi-gal 
(Promega), and 5 |ig of FuGENE was used. A promoter-less luciferase vector (pGL3- 

30 basic)(Promega) and a SV40 luciferase vector (pGL3-control)(Promega) were used as 
controls. Luciferase and [3-galactosidase activities were determined 24 hours post- 
transfusion using the Dual Light chemiluminescent reporter gene assay kit (Tropix, 
Inc.) according to the manufacturer's protocol. The relative activities of the luciferase 
reporter constructs were normalized to the P-galactosidase activity. 

35 
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Example 3: GDF-8 Promoter Mutants 

Additional truncations of the GDF-8 promoter were made using PCR to generate 
DNA fragments containing 0.65 Kb, 0.44 Kb, 0.31 Kb, 0.29 Kb, 0.25 Kb, 0.21 Kb, and 
5 0.1 Kb of sequence upstream of the initiating ATG codon (see Figure 2 and 3 A). These 
DNA fragments were then subcloned into the luciferase reporter plasmid, pGL3 -basic 
(Promega). The constructs (0.65 Kb to 0.1 Kb) were transfected into RD cells and the 
expression of these constructs was determined using luciferase assays (see Figures 3 A 
and 3B). Deleting the region from 0.3 1 Kb to 0.29 Kb decreased the luciferase activity 

10 by about 40%; from 0.31 Kb to 0.25 Kb decreased the luciferase activity by about 60%; 
and from 0.3 1 Kb to 0.21 Kb decreased the luciferase activity by about 90%. 
Examination of the regions from 0.3 1 Kb to 0.29 Kb, and 0.29 Kb to 0.25 Kb, revealed 
the sequence CAAATG (potential E-box) and CAGACA (potential Smad 3 and 4 
binding sequence), respectively. Although examination of the 0.25 Kb to 0.21 Kb 

1 5 region did not reveal any known transcription binding sites, the deletion results suggest 
the possibility of a cis-element. 

To determine whether the decrease in luciferase activity from the deletion 
constructs was due to the regulatory role of the CAAATG, CAGACA and GACAGC 

20 sequences in GDF-8 expression, or due to the physical truncation of the promoter <i 
region, clustered site-directed mutagenesis, using the QuikChange site-directed 
mutagenesis kit (Stratagene), was performed. The following mutants were created 
(CAAATG -> AGATCT ; CAGACA -> AGATCT; and GACAGC -> AGATCT). In 
addition, GDF-8 promoter reporter constructs containing multiple clustered mutations 

25 were also generated. The mutated promoter constructs were transfected into RD cells 
and assayed for luciferase activity (see Figures 4A and 4B). Mutating the CAAATG 
region resulted in approximately 25% decrease in luciferase activity; mutating the 
CAGACA region resulted in an increase in luciferase activity; and mutating the 
GACAGC region resulted in approximately 40% decrease in luciferase activity. 

30 

These results suggest that not only does the physical truncation of the promoter 
region have an effect on luciferase expression, but more importantly, that the sequences 
CAAATG, CAGACA and GACAGC play a regulatory role in expression. The two 
sequences, CAAATG and GACAGC, also appear to act synergistically as demonstrated 
35 by a 60% decrease in luciferase activity in the double mutant. These results also 
suggest that there are multiple regulatory regions (e.g., transcription factor binding 
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regions) in the GDF-8 promoter. Therefore, GDF-8 promoter-luciferase gene reporter 
constructs can be used in high-throughput screenings (HTS) for inhibitors of GDF-8 
expression may yield inhibitors acting at different regulatory sites. 

5 

Example 4: Generation of Expression Constructs Suitable for Use in Screening for 
Regulatory Compounds of GDF-8 Expression 

The mutation of the sequences CAAATG, CAGACA and GACAGC results in 

1 0 either inhibiting (for CAAATG and GACAGC) or enhancing (for CAGACA) the 
expression of the GDF-8 promoter construct pGL3-0.65. This indicates that these 
sequences are involved in the regulation of GDF-8 expression and suggests that these 
sequences are recognized by transcription factors. To demonstrate this, Luciferase 
expression plasmids containing a minimal promoter sequence (pGL3-0.21 containing 

15 the region -1 to —207, Figure 2) and one or more copies of the above regulatory 

sequences upstream of the minimal promoter sequence were constructed. Figure 5A 
shows the double stranded oligonucleotide sequences containing the sequences 
CAAATG and GACAGC and their flanking sequences used in the generation of 
concatemers (multiple copies of the above-identified sequences). Expression of the 

20 luciferase reporter constructs containing the CAAATG sequence was dependent on the 
number of copies of this sequence contained within the construct, while the expression 
of the luciferase reporter constructs remains 100% of the control regardless of the 
number of copies of the GACAGC sequence (Figure 5B). These transfection results 
indicate that these expression plasmids can be used in screening protocols to identify 

25 compounds that regulate specific transcription factors interacting with the above 
regulatory sequences. 

The approaches described in this Example, as well as Examples 1 , 2 and 3 above, 
can also be used to identify any other unknown regulatory sequences in the human 
30 GDF-8 promoter and other GDF-8 promoter, including but not limited to mouse, pig or 
chicken GDF-8 promoters. In addition, the various expression constructs described in 
this Example, as well as Examples 1 , 2 and 3 above, can be used to generate transgenic 
animals to demonstrate promoter and/or regulatory activity in vivo. 



35 
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Example 5: Generation of Cell Lines Suitable for Use in Screening for Regulatory 
Compounds of GDF-8 Expression 

Stable cell lines containing the pGL3-0.65 plasmid sequence were generated 
5 by transfecting RD cells with the pGL3-0.65 plasmid and selecting for stable cell clones 
under G418 pressure selection. The expression profiles of two such stable cell clones 
are shown in Figure 6. Treatment of these cell clones with TGF-p or TNFa down 
regulated the luciferase reporter gene expression demonstrating that the exogenous 
GDF-8 promoter can be regulated and that these cell clones are useful in the screening 
10 and identification of GDF-8 expression regulatory compounds. Stable cell lines 
containing CAAATG and GACAGC concatemer reporter plasmids (described in 
Example 4) can also be generated for use in the screening and identification of GDF-8 
expression regulatory compounds, e.g., compounds that specifically affect regulatory 
proteins (such as transcription factors) binding to these sequences. 

15 

Example 6: Regulatory Sequences in GDF-8 Promoters Are Conserved Among 
Various Species 

20 Additional promoter sequences from chicken and mouse GDF-8 were obtained 

by sequencing a chicken GDF-8 genomic clone, isolated by screening a White Leghorn 
Chicken genomic library (Stratagene), and by screening a mouse GDF-8 genomic clone, 
kindly provided by Dr. Se-jin Lee at Johns Hopkins University (see McPherron et al. 
(1997) Nature 387:83-90). The nucleotide sequence for the pig GDF-8 gene can be 

25 obtained from GenBank Accession numbers AJ1 33580 and AF093798. 

A comparison of human, mouse, chicken and pig GDF-8 promoter sequences 
(160 nucleotides upstream of the TATAA box) is shown in Figure 8 and reveals a high 
level of sequence homology between the four species (see Table 1). In particular, the 

30 regulatory sequences CAAATG and GACAGC are present within this region in all four 
species. The high degree of sequence identity in the promoter regions of these species 
and the conservation of these regulatory sequences in this region indicates that the same 
transcription factors that bind to the human promoter also can recognize regulatory 
sequences in pig, mouse and chicken. Therefore, the same strategies that can be used to 

35 identify regulatory sequences in the human promoter can also be used to identify 
regulatory sequences in the promoter regions of mouse, pig and chicken GDF-8. 
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The activity of the promoter and regulatory elements of the mouse, pig and 
chicken GDF-8 genes can be determined and analyzed using methods described for the 
human GDF-8 gene in Examples 1, 2, 3 and 4. For example, cells and cell lines of 
5 mouse, pig and chicken origin can be used in the in vivo analysis of mouse, pig and 
chicken GDF-8 promoter and regulatory elements, respectively. Since the GDF-8 
promoter region is highly conserved among the different species, in addition to using 
cells and cell lines from homologous species in analyzing promoter and regulatory 
element activities in in vivo assays, cells and cell lines of heterologous species may also 
10 be used. Additionally, the GDF-8 promoter and regulatory element activities of any 

species including, but not limited to, human, mouse, pig and chicken, can be analyzed in 
vivo by the generation of transgenic animals using reporter vectors containing the GDF- 
8 promoter, portion of the promoter, regulatory elements and/or combinations thereof. 



Table 1 



% Sequence 
Homology 


Human 


Pig 


Mouse 


Chicken 


Human 


100 


97 


96 


79 


Pig 




100 


95 


78 


Mouse 






100 


76 


Chicken 








100 



20 Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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